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Abstract 


In this thesis, a lifting based non-linear wavelet transform is proposed. This 
transform changes its filter support size adaptively depending on the data. A 
simple criterion for this adaptive change is proposed. The basic idea is to avoid 
using a large support size filter across the abrupt signal discontinuities(edges). This 
performs better then conventional linear algorithms. The reason being smaller 
number of wavelet coefl&cients is affected because of the edges. This gives an 
effective coding gain. The data itself is used for making the decisions, hence no 
side information needs to be sent along with the data. For coding the transformed 
image the SPIHT coder has been modified. The results of proposed transform are 
much better in case of synthetic test images and comparable in case of natural 
images. Future extensions to the algorithms have been proposed. 



n 



Acknowledgment 


I would like to express gratitude and heartfelt thanks to my supervisor, Dr. 
Govind Sharma. It was because of his guidance, encouragement and valuable 
advice, that this thesis has taken a meaningful shape. 

The word of gratitude should also go to all of my batchmates in ACES-205 
laboratory for the technical discussions we had. In particular I would like to thank 
Ravi and Tushar for helping me with SPIRT coder and wavelet theory. 

I would also like to thank all of my fellow students for making my stay in IIT 
Kanpur a memorable one. 

Last but not least I would like to express my gratitude to wards my family 
without whom I would not have been what I am today. 


Kanpur, India 

June, 2004 (Tushar Padlikar) 


iii 



Contents 


Abstract ii 

Acknowledgment iii 

Contents v 

List of Figures viii 

1 Introduction 1 

1.1 Image compression system 2 

1.2 Overview of Thesis 4 

2 The Lifting Scheme 6 

2.1 Definitions 7 

2.2 Lifting Scheme 8 

2.3 Dual Lifting . . .' 10 

2.4 The Lazy Wavelet 11 

2.5 Predict-Update Architecture 12 

3 Adaptive Lifting 14 


IV 



3.1 Introduction 14 

3.2 Nonlinear Lifting 15 

3.3 Choice of Filters ' 17 

4 Algorithm 21 

4.1 One Dimensional Algorithm 22 

4.2 Two Dimensional Algorithm 1 24 

4.3 Image Coding 26 

4.3.1 SPIHT 26 

5 Results and Discussions 28 

5.1 Results 29 

5.1.1 Synthetic Image 29 

5.1.2 Natural Image 34 

6 Conclusions and Future Scope 42 


V 



List of Figures 


1.1 A General Image Compression System 2 

2.1 Lifting 10 

2.2 Lazy wavelet transform Dual Lifting and Regular Lifting 12 

2.3 Predict-Update Architecture 13 

3.1 Figure showing change of filter support size in proximity of an edge 15 

3.2 Update First Architecture 16 

3.3 H filter 18 

3.4 H filter for (1,3) transform 18 

3.5 H filter for (1,5) transform 19 

3.6 H filter for (1,7) transform • 19 

. ’ t 

4.1 Figure showing inconsistency of Predict-Update approach 22 

4.2 One Dimensional Algorithm 23 

4.3 Two Dimensional Algorithm 24 

4.4 SPIHT tree structure 27 

5.1 Original Synthetic Test Image 29 


VI 



5.2 Reconstructed Image of Adaptive(l,7) Transform BPP 0.30 PSNR 

27.974 30 

5.3 Reconstructed Image of Non-Adaptive(l,7) Transform BPP 0.30 

PSNR 27.776 30 

5.4 Reconstructed Image of Daubechies{9,7) Transform BPP 0.30 PSNR 

20.832 31 

5.5 Reconstructed Image of Adaptive(l,7) Transform BPP 0.50 PSNR 

39.526 31 

5.6 Reconstructed Image of Non-Adaptive(l,7) Transform BPP 0.50 

PSNR 37.565 32 

5.7 R.econstructed Image of Daubechies(9,7) Transform BPP 0.50 PSNR 

32.693 32 

5.8 Reconstructed Image of Adaptive(l,7) Transform BPP 0.59 PSNR 

47.496 33 

5.9 R.econstructed Image of Non-Adaptive(l,7) Transform BPP 0.60 

PSNR 41.508 33 

5.10 Reconstructed Image of Daubechies(9,7) Transform BPP 0.60 PSNR 

37.379 34 

5.11 Original Lena Image 35 

5.12 Reconstructed Image of Adaptive(l,7) Transform BPP 0.29 PSNR 

28.008 35 

5.13 Reconstructed Image of Non-Adaptive(l,7) Transform BPP 0.30 

PSNR 23.442 36 


5.14 Reconstructed Image of Daubechies(9,7) Transform BPP 0.30 PSNR 
23.751 . 

vii 


36 



5.15 B.econstructed Image of Adaptive(l,7) Transform BPP 0.51 PSNR 

31.229 37 

5. 16 Ileconstructed Image of Non-Adaptive(l,7) Transform BPP 0.50 

PSNR 26.228 37 

5.17 Reconstructed Image of Daubechies(9,7) Transform BPP 0.50 PSNR, 

27.676 38 

5.18 P.econstructed Image of Adaptive(l,7) Transform BPP 0.71 PSNR 

32.297 38 

5.19 P,econstructed Image of Non-Adaptive(l,7) Transform BPP 0.70 

PSNR 28.026 39 

5.20 Reconstructed Image of Daubechies(9,7) Transform BPP 0.70 PSNR 

31.198 39 

5.21 PSNR Vs BPP performance comparisons for Synthetic Image . ... 40 

5.22 PSNR Vs BPP performance comparisons for Lena Image 41 


viii 



Chapter 1 


Introduction 


Recent years have witnessed an explosion in the amount of information available in 
the form of digital image data. Take for instance, a database of images consisting 
of fingerprints/photographs in a law enforcement agency like Interpol. A 768x768 
digital gray scale image will take at least 576 kilobytes of space at the rate of 8 
bits per pixel(6pp). The number of images in such a database runs into millions 
and without any image compression algorithm the total database size goes well 
in terabytes. Though the storage/bandwidth constraints are surmounted to a 
great extent an efficient image compression algorithm always adds to the overall 
system performance. The field of Image/ Video compression thus gains a special, 
importance in all the branches of image processing. The following section serves 
aig a general description of the existing methods. 
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CHAPTER 1. INTRODUCTION 2 

1.1 A General Image Compression System 

I here have been many publications [12] attempting to find an optimal solution 
for the data compression problem. A general still image compression system has 
an architecture as shown in figure 1.1. Generally speaking, the image data is 



Figure 1.1: A General Image Compression System 


transformed, depending on the type of compression desired(lossless/lossy) to some 
other domain, so as to separate the redundant and nonredundant parts of the data 
[13] [1 1]. These lYansforms can be again be divided in two broad categories: Linear 
and Non-Linear transforms. 

• Linear Transform : If T’(.) is a transformation acting on vectors x,y ,then 
it is called a linear transform iff, 

T{ax + by) = aT{x)TbT{y) 

for all scalars a,b. Generally speaking, the significant properties of Linear 
transforms are: 

— Lower computational complexity then non-linear counterparts. 


— Lesser prone to errors. 
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T. he Linear Transforms are easier to deal with mathematically, concept 
of frecpiency analysis can be applied to these. 

DCT^DWT2 fall into this category. 

• Non-Linear Transform : The trivial definition of such transforms is the 
absence of linearity. Since these transforms do not have the constraints of 
the linear case there are additional degrees of freedom. These if properly 
exploited can be used to give such data representations which are better 
then linear counterparts for specific applications. The tradeoff is usually in 
terms of greater computational complexity and higher vulnerability to errors. 
Many approaches have been proposed in literature. The details can be found 
in [9] [10] 

The transformed image is then quantized and subsequently coded in the form 

of a bit stream. The quantization step which is shown to be preceding, can be 

embedded in the coding step itself in some cases. If however, it is done as a 

separate stage the siibbands are quantized in an individual manner. This makes 

sure that the perceptually important data has been coded at a higher bit rate. 

The coding scheme depends on the transformation used. SPIHT® is used after the 

image is transformed using DWT, refer section 4.3.1 on page 26 and a combination 

of RLE^, Huffman and differential encoding schemes is used in case of DOT. 

^DCT ; Discrete Cosine Transform 
^DWT : Discrete Wavelet Transform 
'^SPIHT : Set Partitioning in Hierarchical Trees 
^RLE : Run Length Encoding 
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1.2 Overview of Thesis 

In this work a non-linear transformation based on lifting framework [3], [6] is used. 
The additional degrees of freedom obtained by removing the constraint of linearity 
are used in changing the filter supports adaptively. 

One of the problems of the DWT is the cone effect of the edges that is encoun- 
tered at higher levels of decomposition [11]. The. number of subband components 
that are affected by the abrupt signal discontinuities/edges goes on increasing. 
These subband coefficients are large in magnitude and hence it is not possible to 
code them efficiently. The basic idea in this thesis is to make sure that a minimum 
number of coefficients are affected because of the edges present in the images. This 
is done by adaptively detecting the presence of the edges and changing the filter 
support sizes in the proximity if the edges. When such introduction is made in the 
system the problems encountered include 

• Side Information overhead The decisions about the filter usage done at 
the receiver sliould be the same as the ones that are made at the encoder, 
hbr this some side information should also be sent with the coded bit stream. 
This liowever is circumvented by using the coded data for making the filter 
choices 

• Synchronization Since the side information is not being sent there should 
be a robust mechanism to make sure that the decision synchronism is main- 
tained 

• Coding scheme The coding scheme should be lossless. For this the conven- 
tional SPIHT coder has been modified 
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• Error Vulnerability Since the data transmitted is itself used for making ' 
the filter choices there should be robust error protection code for the same. 
Tliis issue however, is not explored in this thesis. 

The report is organized as follows. Chapter 2 discusses the theoretical aspects 
of the lifting scheme, dual lifting and gives an example of lazy wavelet. Chapter 3 
extends the concept of lifting to adaptive lifting. The issues involved are'elucidated 
in this chapter. Cliapter 4. discusses the implementation issues involved. Detailed 
algorithms for one and two dimensional signals are given in this chapter. In Chap- 
ter 5 the results on some test images are discussed. The report concludes with 
Chapter C where possible extensions to this work are discussed. 



Chapter 2 


The Lifting Scheme and Second 
Generation wavelets 


First generation wavelets have become a defacto standard for still image compres- 
sion. Detailed analysis of first generation orthogonal wavelet representations can 
be found in [2], for biorthogonal settings refer [5]. First generation wavelets ba- 
sis functions compact the energy of data in least number of coefficients possible. 
The refuson being their ability exploit the spatio-frequency redundancy present 
in the data sets. These first generation wavelets are traditionally defined as di- 
lates and translates of one particular L^iR) function, the mother wavelet '0(.),i.e, 

The second generation [3] of wavelets on the other hand are not necessarily 
dilates and translates of a mother wavelet and are still able to enjoy all the powerful 
properties of first generation counterparts. Lifting scheme is a simple construction 
of second generation wavelets. Such wavelets can be adapted to intervals, domains, 
surfaces, weights, and irregular samples. 
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2.1 Definitions 

• Multiresolution analysis : Let L 2 be a general function space of spatial do- 
main with a weighted measure y,. Multiresolutuion analysis M is then defined 
as sequence of close supspaces: M = {Vj C L 2 \ j E J C Z}, such that 

1. V} c 

2 . Ujej dense in L 2 , 

3. for each j E J, Vj has a Reisz bases given by scaling functions {(l>j,k \ 

k E K (j)} 

K{j) can be considered as a general index set. A dual multiresolution analysis 
A'/ is defined in a similar manner with closed spaces Vj with Reisz bases 
scaling functions These dual scaling functions are biorthogonal with 
scaling functions in the sense that, 

( 4>j,k' ) = k,k' for k, k! E K{jf 

• Wavelets : A set of functions { (pj^m I i € J,m E M{j) }, where M{j) = 
K{j -H 1)/K{j) is a set of wavelet functions if 

1. The space Wj = closspan{(j)j,m |e M{j)} is a complement of Vj in Vj+i 
and Wj J- Vj 

2. If J = Z; The set {'ipj,m/Uj,m\\ \ j € J,m E M{j)} is a Reisz basis for 

L^- 

If J = N: The set {ipj,m/\\'^i,m\\ \ j eJ,mE M{j)) U {(/>o,fe/ll<^o,fc|l 1 k € 
K{0)} is a Reisz basis for L 2 . 

Dual basis are given by which are biorthogonal to the wavelets, 
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• Refinefnent Relations : 



ie/cO'+i) 

(2.1) 

l&K(j+l) 

(2.2) 

k 

m 

(2.3) 

Where h{.),g{.), h{,),g{.) are finite filters. 



• Biorhtogonality Conditions : 



- On Scaling and Wavelet functions : 



( ) 

= ^k,k' 

(2.4) 

( ^j,Tni ) 

~ ^m,m' 

(2.6) 

( 4^j,k ) ) 

= 0 

(2.6) 

{ 4^j,k ) 

= 0 

(2.7) 

— On Filters ; Biorthogonality implies following relations for the filters 

^ ^ 9j,m,l9j,m',l 

1 

^ ^ hj,k,l9j,m,l 

1 

— ^m,m' 

(2.8) 

= 0 

(2.9) 

1 

9j,rri,lhj^k,l 

= ^k,k' 

(2.10) 

= 0 

(2.11) 


I 


2.2 Lifting Scheme 

Lifting scheme starts with a simple/trivial multiresolution analysis, and gradually 
is lifted up to a multiresolution analysis with particular properties and hence the 
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the name. As was pointed out earlier the lifting scheme is used for construction of 
second generation wavelets/filters. 

Suppose one has access to a set of biorthogonal filters then 

a new set of biorthogonal filters can be constructed as following 



j old 

- 


(2.12) 


= + 


(2.13) 


— — 
yj,m,l 

k 

(2.14) 

9j,m,l 

— 

9j,m,l 


(2.15) 


The effect of these changes are translated for scaling and wavelet functions as 


follows; 



<Pj,k — 


(2.16) 

II 

1 TTl 

(2.17) 


k 

(2.18) 


1 \ 

(2.19) 


This is the essence of the lifting scheme, with operator S {Sj.a;} we have full 


control over all wavelets and dual functions that can be built from a particular set 
of scaling functions. This means we can start from a simple or trivial multiresolu- 
tion analysis and choose S {sj^k} so that the wavelets after lifting have particular 
properties. This allows customdesign of the wavelet and it is the motivation behind 
the name “lifting scheme.”. Conditions on ^pj^rn thus immediately translate into 
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conditions on S. For example, S can be chosen so as to increase the number of van- 
ishing moments of the wavelet, or such that xpj^rn resembles a particular shape. For 
the sake of example, consider a situation where the number of vanishing moments 
of the wavelets have to be increased. 

For ^j^,n to have vanishing moments such that its integral multiplied to Pp is zero; 

[ PpV’j.mdAi = 0 =» f PpXpf^^d^ = Y^Sj^k,mPp(f>f,tdfl 

Jx Jx ^ 

For fixed indices j and m, this is a linear equation in unknowns {sj^k,m | k G 
K{j,m)]. Mathematical equations are realized in the filter bank framework [13] 
as shown in figure 2.1. 



Figure 2.1: Lifting 


2.3 Dual Lifting 

Analogous to lifting is the concept of dual lifting. The basic idea is the same as for 
the lifting scheme except that now the dual scaling function is not changed. The 
equations for this case are given by 


( 2 . 20 ) 
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yj,m,l — yj,m,l 

PJ, ;::old \r^'^ Zold 

yj,m,l yj,m,l / j 


( 2 . 22 ) 

(2.23) 


And as in the first case these conditions translate on the wavelet and scaling 
functions as; 


^j,k 

m 

(2.24) 


II 

O 

(2.25) 


7Tl 

(2.26) 



(2.27) 


k 


The number of vanishing moments of the wavelet can be increased with the 
lifting scheme, and then the dual lifting scheme can be used to increase the number 
of vanishing moments of the dual wavelet. Desired properties on primal and dual 
wavelets can be achieved by iterating lifting and dual lifting alternately. 


2.4 The Lazy Wavelet 

In this section the Lazy wavelet, one of the initial multiresolution analysis to start 
with, is discussed. The Lazy wavelet transform is an orthogonal transform that 
resamples the coefficients into two groups at each step. That is, if d(n) is the data 
sequence then it samples its odd and even coefficients out in de(n) and do{n). 

A fast wavelet transform can be achieved by first applying a Lazy wavelet 
transform (resampling), then a dual lifting, and finally a regular lifting. The filter 
equations for such wavelet transform are then given by 
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Figure 2.2: Lazy wavelet transform Dual Lifting and Regular Lifting 


^j,k,l — 


Sk,l + E 

m 

E Sj,k,mhj,k,l 

m 


( 2 . 28 ) 

( 2 . 29 ) 


2.5 Predict-Update Architecture 

Previous sections have given introduction to the concept of second generation 
wavelets. Lifting scheme, however can be used to generate a multiresolution anal- 
ysis for the case of first generation wavelets too. This section gives an introdcution 
to the notation that will be used in the report while referring to the lazy wavelet 
and subsequent lifting. The steps involved are: 

• Split: In this step the samples of the data are resampled in odd and even 
components, 

• Predict: Odd samples are predicted using even samples and a predict filter P. 
The Prediction error obtained is stored in the same spatial/temporal location 
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Figure 2.3: Predict-Update Architecture 

as the odd sample predicted. This is same as the lifting step following the 
lazy wavelet transform 

• Update: Prediction errors obtained in the previous step are used to update 
the even samples. This step is same as the dual lifting step following lazy 
wavelet transform and regular lifting. 






Chapter 3 


Adaptive Lifting 

3.1 Introduction 

The purpose of the transformation step in image coding is to represent the im- 
age efficiently with smaller number of coefficients. The wavelet transform provides 
such an efficient representation. Most of the wavelet coefficients of a typical image 
are nearly zero. The reason for the efficiency of the wavelet representation is that 
images often are well modeled as a set of locally smooth regions separated by edges. 
Within these smooth regions, fine-scale wavelet coefficients are small, and coeffi- 
cients decay rapidly from coarse to fine scales. In the neighborhood of edges wavelet 
coefficients decay much more slowly, further the number of coefficients affected by 
the edges goes on increasing across the levels. These large wavelet coefficients near 
edges are expensive to code. A transform, which, in some sense, is edge sensitive, 
changing itself adaptively for transforming different regions of the image will be 
intuitively better then conventional transformation algorithms. Adaptive lifting is 
an extension to the the lifting framework for the wavelet transform. In adaptive 
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lifting a set of linear predictors (Section 3.3) that are chosen adaptively using a 
non-linear selection function. The following section elucidates the above idea in 
more specific terms. 

3.2 Nonlinear Lifting 

The analysis filters used for wavelet compression applications typically correspond 
to fourth order polynomial predictors which work well if the underlying signal is 
locally smooth. However, these predictions break down when the signal is not 
locally well- modeled as a low-order polynomial, i.e., near edges and other sin- 
gularities. Using the lifting framework the algorithm switches between different 
predictors based on the local properties of the image. This gives a simple decision 
criterion, if the image is locally smooth, use higher order predictors and near edges 
use lower order predictor. In other words, for finding the wavelet coefficients on 
one side of the edge, near the edge, the length of the predictor is reduced to avoid 
using coefficients across the edge [1] This finally give us a non linear transform.- 


Signal Discontinuity 3 3 5 7 7 



7 7 7 5 3 3 


Figure 3.1: Figure showing change of filter support size in proximity of an edge 

There are, however some problems associated while adding non-linearity to 
the lifting framework. Updated coefficient are interpreted as the low pass filtered 
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output in a conventional predict-update scheme(Section 2.5). To keep this inter- 
pretation as the low pass filter valid, update filter should be changed with the 
predict filters. One more problem can be serious reconstruction errors because of 
the restrictions imposed by the requirements of a conventional compression algo- 
rithm. The transformed image is quantized and thresholded so as to reduce as 
many coefficients to zero as possible. This however creates severe synchronization 
problems as the decisions of choosing the predictors are based on the data itself and 
if data is modified then the decisions made at the encoder and the decoder shall 
not be the same. Even if the problem of synchronization were solved by having the 
encoder to make its choices based on the quantized value there will still be a ” loop” 
problem left. Prom the figure 3.2, it is observed that to maintain the synchroniza- 
tion predictor decisions have to made based on the quantized data qXe{n). But, 
for this data to be present at the encoder, we should have access to Xo{n) which 
cannot be obtained without obtaining the information about the predictors. This 
leads to inconsistency and hence proves that conventional predict-update structure 
cannot be used. 



Figure 3.2; Update First Architecture 

A solution to this loop problem is using update first architecture. The transform 
thus is carried out by first updating the even samples based on the odd samples 
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yielding the low pass coefficients. These low pass coefficients are reused to predict 
the odd samples, which gives the high pass coefficients. Low pass coefficients 
throughout the entire pyramid linearly depend on the data and are not affected by 
the non-linear predictor. 


3.3 Choice of Filters 


To fit the Update-Predict architecture, (1, N) filters of Cohen-Daubechies-Feauveau 
family [5] are used. The filter coefficients for h and h filters are obtained using 
following equations. 


H{w) = 

H{w) = (cosw/2) 


(1 -f e-*"") 


pjgiw[N/2i 


N 


2 

■fc-i 

(sin w/2) 

n=0 


2n 


(3.1) 

(3.2) 


where H{.) and H{.) are the fourier transform of h{.) and h{.) filters. Further, N 
and N are the vanishing moments of the wavelet and biorthogonal wavelet functions 
respectively and 2k = N + N 
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The Predict filters (P(n)) used axe : 

. ( 1 , 3 ) [ - 1/8 11 / 8 ] 

• ( 1 , 5 ) [ - 3/128 11/64 1 - 11/64 3/128 ] 

• ( 1 , 7 ) [ 5/1024 - 44/1024 201/1024 1 - 201/1024 44/1024 

5/1024 ] 

with Update filter t/(n) = [ 1/2 1/2 jin each one of the cases. 




Chapter 4 
Algorithm 


This chapter discusses the algorithm used to implement the transformation. As 
was discussed in the previous section, the decisions about choosing the appropriate 
predictor from a set of predictors should be same at the encoder and the decoder 
to avoid the synchronization errors. To this end, it should be made sure that (a) 
the data at those spatial locations is available and (b) the data is same as it was 
at the encoder. Former one, seemingly trivial, has important repercussions. 

• Choice of predictors depends on how well the data can be modeled by 
polynomials. To this end, a simple technique is devised. In the set of even 
data points ( to be used for prediction ) the even point closest to the odd point 
to be predicted, is predicted using the rest of the even samples. The error 
hence obtained is checked against some predefined threshold T . Depending 
on the error obtained, the predictors are chosen recursively. The predictor 
thus obtained is used for the final prediction and error storage. This way 
decisions depend on the even samples/scalihg function projection coefficients, 
which in turn makes sure that the decisions at the encoder do not depend on 
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the stored prediction errors. 

• To maintain the synchronization the predictor decisions are based on the 
quantized data, which implies that there has to be some feedback structure 
in the encoder. The need of this feedback structure can be further explained 
by looking at the figure 4.1. 



Figure 4.1: Figure showing inconsistency of Predict-Update approach 

The figure shows a two level decomposition of a one dimensional signal (i(n) .If 
the synchronization has to be maintained at the level two (for obtaining h 2 (n)) the 
decisions should be based on the quantized l 2 {n) rather than hin). Maintaining 
similar conditions at level one, however, needs access to data h(n). As is evident 
from the algorithm structure we need the data h 2 (n) and [ 2 ( 71 ) for evaluating li(n). , 
This explanation makes clear the necessity of having a feedback structure in the 
algorithm. The detailed algorithm for one dimensional decomposition is as follows. 


4.1 One Dimensional Algorithm 

The Pseudo-code for a one dimensional algorithm is as following: 
One-Dim(Input Data d(n), Level Decomposition n) { 


For level k = 1 to n 
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First Stage 

Figure 4.2; One Dimensional Algorithm 

• Update the data 
End 

• Quantize the last level data 

• Store in Q{n) 

For level k = -n to 1 

• Predict( Q{n),lk{n) ) — ^ Afc(n) 

• Quantize( hk{n) ) — >■ hk{n) 

• if ( A; 7 ^ 1 ) (if not at the first level ) 

— One- Step- Reconstruction ( hk{n) ,^{^ 1 ) ) — + ifc-i(n) 
- Store( ik-i{n) ) -^ Q{n) 

End 

} 
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Along Columns 


Figure 4.3: Two Dimensional Algorithm 

4.2 Two Dimensional Algorithm 


The filter in case of two dimensional data/images are considered separable, hence 
filtering is done along the rows and columns separately. The algorithm in case of 
images hence happens to be slightly different and is as explained below. 

Two-Dim ( Input Image Im , level decomposition n ) { 

For level k = 1 to n 

• Update Image Along Rows 

• Update Image Along Even Columns ( Along Path 1 ) 

End 

• This gives imLLn 


• Quantize {im,LLn) qimLLn 
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• Store {qimLLn) — > ImQ 
For level k = n to \ 

• Predict( ImQJmLLk ) — ^ ImLHk 

• Quantize( /mLi/jfc ) — > qlmLHk 

• One-step-reconstruction ( qlmLLk , qlmLHk ) — ^ ImLk 

• Predict ( qlmLk ,ImLk ) — > IraHk 

• Update-Along-Columns ( ImHk ) — ^ ImHLk 

• Quantize( ImHLk ) qlmHLk 

• Predict ( qlmHLk , ImHLk ) -* ImHHk 

• Quantize( Im,HHk) ) — > qlmllHk 

• if ( A: 7 ^ 1 ) (if not at the first level ) 

— One-step-reconstruction ( qlmHLk) ,q ImHHk ) — *■ qlmHk 
- One-step-reconstruction ( qlmLk , qlmHk ) — ^ ImLk-i 
— Store( qlmLk-i ) — >■ Im.Q 


End 

} 
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4.3 Image Coding 

The image is quantized in the transformation step itself. This quantization has 

; 

to be preserved by any coding strategy to be used. In this work modified version 
of SPIHT(Set Partitioning In Hierarchical Trees) is used which is a well known 
algorithm used for coding images. This modification stems from the fact That the 
data is to be reconstructed perfectly from the bit stream the SPIHT generates. 
Following section gives a brief introduction of SPIHT, for details, refer [8] 

4.3.1 SPIHT 

The transformed image is considered to be present in the form of tiles(figure 4.4)^ 
and each coefficient is labeled Cjj, where i,j are the pixel coordinates. Now, the al- 
gorithm selects the coefficients Ci^ such that 2" < |cij| < 2”"^^, with n decremented 
in each pass. Given n, if \cij\ > 2” then the coefficient is considered significant 
else it is called insignificant. The sorting algorithm partitions the coefficients in 
subsets T„i and performs magnitude tests. 

maXi;j^Tm{\ci,j) > 2 "? 

If the answer is no then all coefficients of that set are insignificant otherwise, using 
a rule shared by encoder and deeoder the set Tm is partitioned into new subsets 
Tm,i^ and the significance test is applied to the new subsets. This division process 
continues until the magnitude test is done to all single coordinate significant subsets 
in order to identify each significant coefficient. 

Set partitioning rule should be such that subsets expected to be insignificant 
contain a large number of elements and the subset expected to be significant con- 
tains a single element. This requirement is satisfied by spatial orientation trees. 
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These spatial orientation trees exploit the spatial self similarity of the images by 
defining the nodes of the trees as points which are in same spatial locations in 
higher levels of subbands (figure 4.4). While partitioning the tree the two sub- 



Figure 4.4: SPIHT tree structure 

sets formed are of the immediate descendent’s of the node and the rest of the 
descendants. Since a magnitude ordering is expected in the coefficients as one is 
traversing down the tree, this partitioning algorithm reduces the number of mag- 
nitude comparisons by exploiting spatial sef-similarity. Further, this partitioning 
rule is shgired at the encoder and the decoder so the ordering information need not 
be sent with the data. Those coefficients which become significant in a certain pass 
are further refined by sending there bit planes. This algorithm terminates when 
the bit rate becomes equal to the one specified, but in this case since any quantiza- 
tion is not expected form the SPIHT. The sorting pass, thus terminates when n is 
such that 2" is becomes less then the threshold defined while quantization and the 
refinement pass continues till all the bits of the coefficients that were candidates 
for significance, have been sent. ■ 




Chapter 5 

Results and Discussions 


The aIgorithm( Adaptive (1,7) transform) was used to transform and reconstruct 
test images at various bit rates. The PSNR was then compared with the following 
algorithms. 

• Non-Adaptive{l, 7) transform: This is essentially same as Adaptive(l, 7) 
transform , except for the adaptivity , 

• Daubechies{9, 7) transform: The standard transform used in JPEG200, the 
wavelets and scaling functions have four vanishing moments each, The de- 
tailed mathematical properties of this transform are discussed in [7] 

All the simulations were performed using Matiab, using the following test images : 

• Lena 512 x 512 

• Synthetic image 512 x 512 
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5.1 Results 

5.1.1 Synthetic Image 

The A(laptive(l,7) transform outperformed both the standard Daubechies(9,7) 
transform as well as Non-Ada.ptive(l,7) transform for synthetic images. The results 
are shown for three different bitrates. Figure 5.1 is the synthetic image on which 
the operation of decomposition and reconstruction was performed. As can be seen 
from the image, edges in all the directions are present. The reconstructed image af- 
ter Adaptive(l,7) transform (figure 5.2) can be seen to preserve the edges in a better 
manner. This is attributed to the fact that a large number of wavelet coefficients 
are not effected because of the edges and after tlie quantization/thresholding op- 
eration on the transformed image, the edges do not spill over to neighboring areas 
as. in the case of Daubechies(9,7) transform, see figure 5.4. 
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Figure 5.2; Reconstructed Image of Adaptive(l,7) Transform BPP 0.30 PSNR 
27.974 



Figure 5.3; Reconstructed Image of Non-Adaptive(l,7) Transform BPP 0.30 PSNR 
27.776 
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Figure 5.4; Reconstructed Image of Daubechies(9,7) Transform BPP 0.30 PSNR 
20.832 



Figure 5.5: Reconstructed Image of Adaptive(l,7) Transform BPP 0.50 PSNR 
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Figure 5.6; R.econstructed Image of Non-Adaptive(l,7) Transform BPP 0.50 PSNR 
37.565 


Figure 5.7: Reconstructed Image of Daubechies(9,7) Transform BPP 0.50 PSNR 
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Figure 5.8: 
47.496 


Reconstructed Image of Adaptive(l,7) Transform BPP 0.59 PSNR, 


Figure 5.9: Reconstructed Image of Non- Adaptive (1,7) Transform BPP 0.60 PSNR 
41.508 
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Figure 5.10: Reconstructed Image of Daubechies(9,7) Transform BPP 0.60 PSNR 
37.379 

5.1.2 Natural Image 

Lena 

The Transforms were also compared for the natural images. The results are pre- 
sented here for standard gray scale lena image, figure 5.11. It is again seen that 
the performance of the adaptive transform, figure 5.12 is better then the other 
transforms (figure 5.14, figure 5.13). Here, though the PSNR performance of the 
proposed transform is always better, perceptually the two are comparable. This 
again validates the fact that PSNR performance does not always correspond to 
better perce.ptual appearance. 
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Figure 5.11: Original Lena Image 



Figure 5.12: R.econstructed Image of Adaptive(l,7) Transform BPP 0.29 PSNR 


28.008 
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Figure 5.13: Reconstrticted Image of Non-Adaptivc(l,7) Transform BPP 0.30 
PSNR 23.442 



Figure 5.14: Reconstructed Image of Daubechies(9,7) Transform BPP 0.30 PSNR 
23.751 
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Figure 5.15: Reconstructed Image of Adaptive(l,7) Transform BPP 0.51 PSNR 
31.229 


Figure 5.16: Reconstructed Image of Non-Adaptive(l,7) Transform BPP 0.50 
PSNR 26.228 
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Figure 5.17: Reconstructed Image of Daubechies(9,7) Transform BPP 0.50 PSNR 
27.676 



Figure 5.18: Reconstructed Image of Adaptive(l,7) Transform BPP 0.71 PSNR 
32.297 
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Figure 5.19: Reconstructed Image of Non-Adaptive(l,7) Transform BPP 0.70 
PSNR 28.026 



Figure 5.20: Reconstructed Image of Daubechies(9,7) Transform BPP 0.70 PSNR 


31.198 
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Comparisons 


The plots of Bits per pixel versus PSNR for the three transforms are as shown in 
figures 5.21 and 5.22 



Figure 5.21: PSNR Vs BPP performance comparisons for Synthetic Image 
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Figure 5.22; PSNR Vs BPP performance comparisons for Lena Image 
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Chapter 6 


Conclusions and Future Scope 

In this work the lifting framework was used for adaptive wavelet transforms. The 
Adaptive(l,7) found to perform better then Non-adaptive(l,7) transform in all the 
cases. It even outperformed daubechies(9,7) transform in case of synthetic images. 
In case of natural images, the performance of the Adaptive(l,7) is comparable to 
the standard daubechies(9,7) transform and as in the case of synthetic images, 
much better than Non-adaptive(l,7) transform. Proposed transform performs dis- 
tinctively better hear the edges. This is because of algorithms’ ability to change 
the filter size which make sure that not a very large number of coefficients are 
affected because of the edges. 


Future Scope 

• In this work the adaptivity was added to a (1,7) transform. If this can be 
extended to Daubechies(9,7) transform the results can be expected to be 
better. 
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• The idea of adaptive lifting can be extended to wavelet packet decomposition 

• The idea can be extended to 3-D video coding. 

• The bit stream if subjected to errors severely degrades the image. This is 
because these errors propagate across the scaled in form of incorrect filter 
decisions. Some robust error protection should be used 
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